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ABSTRACT 

We  present  an  optimal  method  for  storing  a  sequence  of  similar  lists  that  permits  rapid 
searching.  Applications  include  an  optimal  planar  point  location  algorithm,  more  general  than  pre- 
vious ones,  and  a  3-dimensional  point  location  algorithm. 


1.  Introduction 

We  consider  the  following  problem.  We  are  given  a  linear  sequence  of  k  lists,  each  of  n 
items,  with  adjacent  lists  being  similar;  more  precisely,  the  sum  of  the  differences  between  adja- 
cent lists  is  h.  No  two  adjacent  lists  are  identical,  but  non  adjacent  lists  may  be  identical.  We 
provide  a  data  structure  for  storing  the  lists  on  which  the  following  operations  can  be  carried  out: 

(a)  Fmd  the  ith  largest  item  in  the  jth  list,  taking  time  O(logn). 

(b)  List  the  items  between  the  ith  and  the  jth  in  the  Ith  list,  taking  time  0(J  —  i  +  \ogn). 

We  present  two  solutions;  a  simple  one  requiring  space  0(n  +  h\ogn)  and  a  more  complex  version 
requiring  space  0(n  +  h). 

This  problem  arose  in  the  work  on  the  geometric  retrieval  problem  by  Cole  and  Yap  [CY], 
where  one  wants  to  solve  a  variant  of  problem  (b)  as  efficiently  as  possible.  More  generally,  in 
computational  geometry  it  is  frequently  the  case  that  one  wants  to  store  many  similar  lists,  though 
sometimes  only  needing  to  carry  out  a  simpler  version  of  operation  (b),  namely,  to  output  all  the 
items  in  a  given  list  [L,BM].  It  is  an  issue  which  arises  in  graphics  also,  where  it  is  termed  coher- 
ence; the  problem  is  to  take  advantage  of  many  lists  being  similar  in  carrying  out  processing  on  the 
several  lists  [SSS]. 

Chazelle  [Chi]  studies  a  similar  problem,  which  is  presented  in  a  rather  different  setting.  He 
considers  an  ordered  set  S  of  n  items  V;,v2,...,v,  subject  over  time  to  h  insertions  and  deletions. 
He  wishes  to  build  a  data  structure  to  allow  the  following  query  to  be  answered  quickly:  determine 
which  item  v,  immediately  preceded  an  object  q  at  time  9;  he  calls  this  item  the  neighbour.  He 
presents  a  data  structure  that  requires  storage  0(n  +  h)  and  allows  the  computation  of  a  neighbour 
in  time  0(logn  \ogh).   Further,  the  set  S  need  only  be  partially  ordered. 

We  show  this  problem  can  be  solved  using  essentially  the  solution  to  the  list  problem.  We 
obtain  a  solution  that  requires  space  0{n  +  h)  and  allows  the  finding  of  a  neighbour  in  time 
0(logn  +  log/i).   Our  solution,  while  more  efficient,  is  very  similar  to  Chazelle's  solution. 


We  have  several  applications.  We  give  a  planar  point  location  algorithm  requiring  space 
0(n)  and  having  a  query  time  of  0(log/i)  that  is  more  general  that  previous  solutions.  We  also 
present  a  3-dimensional  point  location  algorithm.  And  we  give  an  improved  representation 
scheme  for  polyhedra,  enabling  one  to  improve  an  existing  polyhedra  intersection  algorithm. 

In  the  next  two  sections  we  present  the  solutions  to  the  list  problem.  In  section  4,  we  show 
how  to  reduce  Chazelle's  problem  to  a  list  problem.  Then,  in  section  5  we  present  applications 
and  tn  section  6  we  conclude. 

2.  A  First  Solution 

To  make  the  presentation  simpler  we  assume  adjacent  lists  differ  on  exactly  one  point.  We 
note  we  can  always  achieve  this  by  adding  additional  lists.   So  h  =  k+l. 

It  is  convenient  to  think  of  the  lists  lying  in  an  array,  the  ith  list  being  the  ith  row  of  the 
array.  Each  list  is  stored  in  the  natural  way  in  a  balanced  binary  tree;  that  is,  the  median  item  (the 
\nJ2\th  item)  is  stored  at  the  root,  the  sublist  to  the  left  of  the  median  is  recursively  stored  in  the 
left  subtree,  and  the  sublist  to  the  right  of  the  median  is  recursively  stored  in  the  right  subtree. 

Thus,  we  associate  a  tree  with  each  row  of  the  matrix.  It  may  be  the  case  that  a  sequence  of 
adjacent  trees  have  identical  left  (right)  subtrees.  Then  we  store  only  one  copy  of  the  identical 
left  (right)  subtrees,  all  the  trees  in  question  sharing  the  same  left  (right)  subtree.  This  space  sav- 
ing is  repeated  recursively,  separately  for  the  left  and  right  subtrees. 

1i  two  identical  left  (right)  subtrees  are  separated  by  a  distinct  subtree  then  we  will  keep  two 
copies  of  the  identical  subtrees.  This  does  not  affect  the  current  solution,  but  is  important  for  the 
improved  solution  below. 

Suppose  we  store  u  left  subtrees  and  v  right  subtrees  for  the  k  trees  associated  with  the  k 
lists.  An  alternative  view  is  that  the  u  left  subtrees  represent  u  lists  of  length  [nH^-l  (the  left 
sublists),  with  the  sum  of  the  differences  between  the  lists  being  u-l  =  r,  and  the  v  right  subtrees 
represent  v  lists  of  length  n-  {nH^  (the  right  sublists),  with  the  sum  of  the  differences  between 


adjacent  lists  being  v - 1  =  j .   We  note  r+s-^h. 

We  can  still  perform  a  binary  search  on  any  of  the  lists  in  time  O(logn). 
The  storage  S(h,n)  is  given  by: 

For  A^l,  S(h,n)  ^  0(h)  +  S{r,na)  +  S{s,nri) 

and  5(0,/i)  =  0{n). 

The  solution  is  S{h,n)  =  0{n  +  h\ogn). 

3.  A  Second  Solution 

The  next  attempt  at  a  solution  gives  up  the  nice  recursive  structure  we  had  above.  We 
return  to  the  matrix  analogy.  For  each  column  (items  of  the  same  rank),  for  each  sequence  of 
adjacent,  identical  items,  we  store  exactly  one  copy.  Again,  if  two  identical  items  of  the  same 
rank  are  separated  by  a  distinct  item  we  store  two  copies  of  the  identical  items. 

To  be  able  to  determine  in  which  lists  (rows)  an  item  lies  we  need  to  store  the  span  of  an 
item,  together  with  its  value.  The  span  of  an  item  is  merely  those  lists  in  which  the  item  lies; 
recall  that  a  list  is  identified  by  its  row  index.  Thus  eadi  item  becomes  a  triple  <v,l,u>,  v  being 
its  value,  and  /  and  u  being  the  lower  and  upper  bounds,  respectively,  for  its  span. 

Each  column  of  the  matrix  is  stored  as  a  binary  tree.  The  space  requirement  is  0(n  +  /j),  the 
search  time  for  operation  (a)  is  0{\ogh),  but  we  are  unable  to  perform  operation  (b)  efficiently, 
for  there  is  no  link  between  adjacent  columns. 

It  is  helpful  to  think  of  the  first  solution  as  a  single  search  tree,  with  each  node  of  the  tree 
storing  a  column  of  the  matrix.  Next,  we  consider  the  links  between  the  items  stored  in  a  parent 
node  of  the  search  tree  and  the  items  stored  in  the  left  (right)  cMd.  With  each  item  in  the  parent 
node  we  have  a  pointer  to  the  item  in  the  same  list,  or  row,  in  the  left  (right)  child. 


Now,  we  add  the  data  compression  described  above:  "that  is,  for  each  column,  for  each 
sequence  of  identical,  adjacent  items  we  store  exactly  one  copy,  and  with  each  item  we  record  its 
span.  Next,  we  describe  the  pointers  we  add  to  allow  the  search  to  proceed  from  one  level  of  the 
search  tree  to  the  next.  Let  us  consider  just  one  pair  of  parent'child  nodes  in  the  search  tree. 
Suppose  the  items  stored  at  the  parent  node  have  spans  </,,«;>,  <l2,U2>,  ...,  <l^,u^>,  with 
/,_-=M,  +  l,  for  j  =  l,2,...,r-l,  and  the  items  stored  at  the  child  have  spans  <j-^,t-^>,  <J2,ti>,  ..., 
<j^,t^>,  with  ;,+i  =  f,  +  l,  for  i=l,2,...,s-l.  GDnsider  the  span  </,,m,>.  Suppose  it  overlaps  with 
the  spans  <y,^.,f^^->,  <j^^2''x^2^'  •••'  <A*i''v*i>;  ^^  is,  7\^.^/,<f^^,,  and  ;\+4<M,<fj+4. 
We  will  refer  to  these  k  spans  as  the  range  of  </,,u,>.  Notice  that  the  ranges  corresponding  to 
two  adjacent  spans  (in  the  parent)  overlap  in  at  most  one  of  their  extremal  spans  (in  the  child). 
Thus  we  can  store  each  range  containing  two  or  more  spans  as  a  balanced  binary  tree;  these  trees 
may  share  their  leftmost  and  rightmost  leaves  with  the  trees  for  neighbouring  ranges.  A  range 
consisting  of  a  single  span  is  stored  as  the  one  item.  Each  span  </,,",>  is  given  a  pointer  either 
to  the  root  of  the  tree  holding  its  range,  or  to  the  single  span  forming  the  range. 


For  example: 

Suppose  that  at  the  parent  level  we  have  spans: 

<1,10>,<11,12>,<13,14>,<15,20> 

and  at  the  chUd  level  we  have  spans: 

<1,3>,<4,7><8,15>,<16,17>,<18,20>. 

We  then  obtain  the  following  structure: 


<1,10>      <1 

1,12>       <13,14>       <15,20> 

/      ^^ 

/      <16,17> 

uV\  - 

<8,15>                  <18,20> 

<4,7> 

Key:  /  range  tree  edge, '^pointer  from  one  level  of  search  tree  to  next. 

The  space  used  by  this  structure  is  0(n  +  h).   To  carry  out  operation  (a)  we  have  to  search 
up  to  logn  levels  of  the  search  tree,  and  at  each  level  we  have  to  search  a  range  which  may  include 


as  many  as  0(h)  spans.  So  the  search  at  one  level  takes  OQogh)  time,  giving  an  overall  search 
time  of  C>(log/ilog/i).  We  still  have  difficulty  carrying  out  operation  (b),  the  problem  being  that  it 
takes  0(log/i)  time  to  go  from  one  level  of  the  search  tree  to  the  next,  and  hence  it  takes  time 
0(lognlog/i+jlog/i)  to  list  5  items. 

We  make  the  data  structure  more  efficient,  thereby  allowing  operation  (b)  to  be  performed 
efficiently.  Our  difficulties  arose  because  the  range  corresponding  to  a  span  in  the  parent  level 
may  include  many  spans  (in  the  child  level).  We  avoid  this  problem  by  introducing  additional 
items  at  the  parent  level.  If  a  range  has  Zik+l  spans  we  create  *+l  items  at  the  parent  level  to 
replace  the  single  item  we  had  previously.  So  we  inaease  the  number  of  items  by  k.  Suppose  the 
original  span  at  the  parent  level  was  <l,u>,  and  the  range  consisted  of  the  spans  <ji,ti>, 
<ji,f-i>,  ...,  <Jn,+:>'ik-i-^-  Thei^  ^s  nsw  items  all  have  the  same  data  value  as  that  of  the  item 
they  replace,  and  they  have  spans  </,r;>,  <73,/4>,  ...,  <J2i-i>'2i><  •••>  <J2jc*u">-  ^  ^  range 
has  2k  spans  we  aeate  k  items  at  the  parent  level,  inaeasing  the  number  of  items  hy  k-1.  Sup- 
pose the  span  at  the  parent  level  was  <t,u>,  and  the  range  consisted  of  the  spans  <J.,t.>, 
<J2,':>.  •••,  <J2i'^ik^-    T^^^  ^e  new  items  have  spans  </,r;>,  <;'3,r4>,  ...,  <J2i-i'hi^'  •••' 

We  carry  out  this  process  for  each  node  in  the  search  tree,  first  with  respect  to  the  left  child 
and  then  with  respect  to  the  right  child.  We  proceed  from  the  leaf  level  to  the  root  of  the  search 
tree,  level  by  level. 

^Tien  carrying  out  a  search,  in  going  from  an  item  at  the  parent  level  to  the  left  (right)  child 
level,  we  have  at  most  two  items  in  the  range.  Thus  this  step  of  the  search  takes  0(1)  time.  The 
search  is,  at  most,  repeated  at  each  of  the  logn  levels  of  the  tree,  giving  a  search  time  of  O(logn). 
Also,  we  need  to  start  the  search  at  the  root.  But  we  have  at  most  h  items  at  the  root.  Hence 
starting  a  search  at  the  root  takes  at  most  O(logA)  time,  so  Lhe  overall  search  time  (operation  (a)) 
is  0(]ogn  +  ]ogh).  Likewise,  operation  (b)  takes  time  0(]ogn  +  ]ogh  +  s),  where  s  is  the  number 
of  items  listed. 


It  remains  to  show  the  size  of  the  search  tree  is  still  0(n  +  ;»). 

Lemma:  The  size  of  the  seach  tree  described  above  is  0(n  +  h). 

Proof:  For  accounting  purposes,  we  will  no  longer  count  the  rightmost  span  as  being  part  of  a 
range.  This  reduces  the  size  (in  spans)  of  each  range  by  one  and  ako  ensures  the  ranges  are  dis- 
joint. If  a  range  has  2k-l  spans  (2k  previously)  we  create  k  items  at  the  parent  level  to  replace 
the  one  we  had  before.  So  we  inaease  the  number  of  items  by  fc-1.  Likewise  if  a  range  has  2k 
items  (2k  + 1  previously)  we  increase  the  number  of  items  by  k.  We  deduce  the  increase  in  the 
number  of  items  at  the  parent  level  is  at  most  half  the  number  of  items  at  the  child  level. 

The  new  items  we  introduce  may  themselves  contribute  to  the  aeation  of  further  new  items, 
but  only  in  the  ratio  of  two  for  one.  Suppose  there  is  an  increase  of  i  items  in  the  search  tree.  So 
we  have  n  +  h  +  i  items  in  the  search  tree  (we  had  n  +  ;i  items  before  we  started  increasing  their 
number).  Since  it  takes  two  items  to  cause  an  increase  of  one  item  we  have  2i  ■<  n  +  h  +  i.  So 
i  <  n  +  h,  and  the  size  of  the  search  tree  is  still  Cn  +  h).  □ 

The  technique  of  introducing  items  at  a  rate  of  one  for  two  is  a  combination  of  a  technique 
of  Willard's  [W]  and  a  technique  of  Chazelle's  [Ch2]. 

4.  Finding  Neighbours 

We  show  how  to  solve  Chazelle's  problem  [Chi]  using  the  data  structure  desdibed  above. 
Fu-st,  we  state  the  problem  precisely,  let  S  =  {v.,v.,...,v„}  be  a  set  of  n  objects,  with  a  partial 
order  R.  Each  object  is  either  active  or  inactive  according  to  the  value  of  a  parameter  9;  we  let  9 
take  values  in  the  set  of  real  numbers,  and  we  refer  to  9  as  the  time.  Define  the  active  set  at  time 
9^  5(9)  =  {v,.|v,  active  at  9}.  We  assume  we  only  have  finitely  many  sets  5(9).  We  also  assume 
the  partial  order  R  is  a  total  order  on  each  of  the  sets  5(9).  We  define  a  query  to  b^  a  pair  (^,9), 
where  q  is  an  object  (not  necessarily  in  5),  which  "extends"  the  total  order  on  5(9),  i.e.  the  out- 
come of  "q^vT  for  each  v€5(9)  does  not  contradict  the  total  order  on  5(9).    The  outcome  is 


meaningless,  however,  for  vlS(Q).  Define  the  hwer  neighbour  of  <?  as  the  largest  object  v  in  S(9) 
such  that  v<q,  and  the  upper  neighbour  of  ^  as  the  smallest  object  v  in  5(9)  such  that  v>q.  The 
response  to  the  query  (^,6)  is  the  lower  neighbour  of  q  at  time  9.  The  problem  is  to  preprocess  S 
so  that  the  lower  neighbour  can  be  computed  efficiently. 

We  write  the  set  5(9)  as  the  list  L(.,9),  with  L(i,9)  =  v,  if  v,€5(9),  and  Z,(i,9)  =  absent  oth- 
erwise. The  h  deletions  and  insertions  to  the  set  S  correspond  to  the  sum  of  the  differences 
between  adjacent  lists. 

Given  a  query  (^,9)  we  want  to  perform  a  search  on  the  list  Z, (.,9).  We  would  like  to  per- 
form a  binary  search  as  it  is  fast.  However,  it  may  happen  that  there  is  no  item  stored  at  the 
median  location,  for  example,  and  then  we  are  unable  to  start  the  search.  So  we  alter  the  posi- 
tions of  some  items  in  the  lists  Z.(.,9)  to  allow  a  binary  search  to  be  carried  out.  We  call  this  a 
demotion  strategy  because  any  item  that  is  moved  receives  a  lower  rank  than  it  had  before. 

To  facilitate  our  explanation  of  the  demotion  strategy  we  imagine  the  list  to  be  stored  in  the 
natural  way  in  a  binary  tree  (the  item  at  the  median  location  at  the  root,  the  items  before  it  in  the 
left  subtree,  and  these  after  in  the  right  subtree).  The  demotion  is  carried  out,  level  by  level, 
working  from  the  root  to  the  leaves,  using  the  following  rule  for  the  location  with  rank  j.  If  there 
is  no  item  of  rank  j,  we  move  the  item  of  lowest  rank,  if  any,  among  the  items  originally  in  j's 
right  subtree,  to  the  location  of  rank  j,  if  this  item  has  not  been  moved  already;  if  it  has  already 
been  moved,  we  move  no  item. 

Let  us  consider  a  binary  search  on  the  list  Z,(.,9),  following  the  demotions.  If,  in  the  course 
of  the  search,  we  compare  v  to  the  item  stored  at  the  location  with  rank  j  then  we  branch  accord- 
ing to  the  results  of  the  comparison.  If  there  is  no  item  stored  at  this  location  there  are  two  possi- 
bilities: 

(a)     some  item  /  originally  in  j's  right  subtree  was  demoted  out  of  the  subtree  rooted  at  j\  then 
we  want  to  branch  to  j's  right  subtree. 


(b)     there  was  no  item  in  y"s  right  subtree;  then  we  want  to  branch  to  j's  left  subtree. 
By  recording  its  old  rank  with  each  item  we  are  able  to  determine  whether  case  (a)  or  case  (b) 
applies.   For  if  case  (a)  applies  we  will  have  compared  v  to  the  demoted  item  i  before  seeking  an 
item  at  the  location  with  rank  j,  and  if  case  (b)  applies  we  will  not  (there  not  being  such  an  item 
i).  Thus  we  are  able  to  carry  out  a  binary  search. 

We  now  calculate  how  different  two  adjacent  lists  become  as  a  result  of  the  demotion  stra- 
tegy. We  observe  that  for  each  change  (insertion,  deletion)  between  adjacent  lists  there  are  at 
most  two  differences  in  the  results  of  the  demotion  strategy:  one  for  the  item  inserted  or  deleted, 
and  one  for  its  upper  neighbour.  Thus  the  sum  of  the  differences  between  adjacent  lists,  after  the 
demotions  have  occurred,  is  still  0(h). 

We  store  the  lists  L(.,e),  following  the  demotions,  using  the  data  structure  described  in  sec- 
tion 3.  We  can  find  a  lower  neighbour  in  time  0(\ogn  +  \ogh),  an  improvement  on  Chazelle's 
time  bound  of  0(log/i!og/j). 

Careful  inspection  reveals  that  the  data  structure  we  build  is  a  refinement  of  a  structure  very 
similar  to  the  canal  trees  introduced  by  Chazeile  [Chi]  for  the  same  problem.  In  fact,  the  very 
similar  structure  is  the  one  with  a  query  time  of  0(\ogn\ogh),  obtained  by  using  the  first  stage  of 
our  second  solution  to  store  the  lists  I(.,6). 

We  now  consider  the  question  of  building  the  data  structure  in  an  efficient  way,  without  con- 
structing all  h  lists,  for  this  would  take  time  at  least  0{nh).  First,  we  perform  a  topological  sort 
on  the  items  to  obtain  a  total  order  imbedding  R,  the  given  partial  order.  Suppose  we  are  given 
5(9.)  and  a  list  of  the  changes  from  5(9,)  to  5(9,^,),  for  each  value  of  i.  We  store  5(9^)  as  a  bal- 
anced tree;  by  performing  a  sweep  across  time,  inserting  and  deleting  items  as  appropriate,  as  we 
obtain  the  sets  5(9,)  we  compute  the  changing  ranks  of  the  locations  of  each  item,  including  the 
effects  of  the  demotion  strategy.  Retiuiiing  to  the  matrix  analogy,  the  list  Z,(.,9,)  corresponds  to 
row  J.    We  have  computed  each  column  of  the  matrix,  but  we  have  only  recorded  an  item  in  a 


c»lumn  each  time  there  is  a  change  in  the  value  of  the  item  stored  in  that  column.  It  remains  to 
add  the  extra  items  that  enable  searches  to  proceed  efficiently.  This  can  be  done  by  a  sweep  of 
the  search  tree,  level  by  level,  from  the  leaves  to  the  root;  it  takes  time  0{n  +  h).  So  the  overall 
time  taken  to  build  the  search  tree  is  0(n  +  hlogn). 

S.  Applications 

5.1.  Planar  Point  Location 

We  are  given  a  planar  graph  with  straight  edges.  Each  vertex  is  specified  by  its  two  coordi- 
nates and  each  edge  by  its  endpoints.  The  problem  is  to  build  a  data  structure  to  permit  the  rapid 
answering  of  queries  of  the  form:  'in  which  region  does  a  given  point  lie?'.  This  problem  has  been 
studied  by  many  authors  including  [DL,LP,P,K,Chl,LT].  The  complexity  is  usually  measured  as  a 
space-time  pair:  Lhe  space  for  the  data  structure  and  the  time  to  answer  a  query.  The  first  optimal 
solution  (complexity  0(n,\ogn))  was  discovered  by  Lipton  and  Tarjan  [LT];  however  their  solution 
was  mainly  of  theoretical  interest  as  it  was  very  complicated.  Subsequently  Kirkpatrick  [K] 
discovered  a  much  simpler  optimal  solution;  however  it  still  requires  that  the  graph  be  triangulated 
and  there  are  levels  of  recursion  in  his  data  structure,  which  keep  the  solution  from  being  very 
simple.  We  present  another  optimal  solution,  which  is  simple  and  does  not  require  the  triangula- 
tion  of  the  graph.  Our  solution  is,  in  some  sense,  an  efficient  implementation  of  an  early  solution 
to  this  problem  by  Dobkin  and  Lipton  [DL];  Preparata  [P]  gives  a  solution  which  resembles  the 
one  we  give;  but  it  is  not  optimal  and  it  is  much  more  complicated  to  describe.  The  early  solution 
and  Preparata's  have  the  advantage  that  they  do  not  require  the  assumption  that  the  graph's  edges 
be  straight  lines;  nor  does  our  solution. 

Now  for  the  solution.  We  simply  define  a  partial  order  on  the  edges  of  the  graph.  Let  e 
and  /  be  edges  of  the  graph.  We  define  e<f  \i  there  is  a  line  x=c  which  intersects  both  e  and  /, 
and  it  intersects  e  at  a  smaller  value  of  y.  Time  is  taken  to  be  the  x-coordinate.  We  use  the  data 
structure  for  the  neighbour  problem.   The  sets  5(9)  change  at  the  x  values  of  the  vertices  of  the 
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graph.  This  solution  has  complexity  (n,logn).  This  reduction  was  first  described  by  Chazelle 
[Chi],  where  he  gave  an  algorithm  having  complexity  0(n,log-«). 

We  now  relax  the  straight  edge  condition.  We  notice  that  the  solution  given  above  merely 
requires  that  each  edge  be  intersected  at  most  once  by  any  line  x=c.  Suppose  the  edges  are  circu- 
lar arcs.  We  introduce  new  vertices  at  those  points  of  the  arcs  that  have  vertical  tangents;  the  only 
candidates  are  the  leftmost  and  rightmost  points  of  the  arcs.  This  introduces  at  most  two  new  ver- 
tices per  arc,  and  at  most  triples  the  number  of  edges.  We  can  now  use  the  partial  order 
described  above,  obtaining  a  solution  with  complexity  0(n,\ogn),  which  is  optimal.  The  previous 
solution  to  this  problem  had  complexity  0(nlogn,logn)  [P].  This  problem  was  originally 
motivated  by  the  fixed  radius  circle  retrieval  problem  [BM]. 

We  are  also  able  to  improve  on  a  'general'  planar  point  location  algorithm.  The  problem 
was  posed  in  [EM].  Define  a  region  thus.  A  region  ^  is  a  simply  connected,  closed  and  bounded 
subset  of  the  Euclidean  plane  such  that 

(a)  any  line  parallel  to  the  y-axis  intersects  i?  in  at  most  two  points  or  is  a  tangent  line 

(b)  in  time  0(1)  we  can  find  a  left  endpoint  of  R,  denoted  by  l(R)  and  a  right  endpoint  of  R, 
denoted  by  r(R). 

(c)  if  ;7  is  any  point  with  x-coordinate  lying  between  /(/?)  and  r(R),  then  in  0(1)  time  we  can 
determine  whether  p  is  in,  below,  or  above  R . 

An  0(n,log^n)  algorithm  was  given  in  [EM]. 

We  use  the  solution  described  above  for  the  graph  with  circular  arcs,  which  gives  us  an  0(n,logn) 

point  location  algorithm. 

£.2.  3-Dimeiisional  Point  Location  Algorithm 

We  use  the  algorithm  due  to  Dobkin  and  Lipton  [DL],  implementing  it  with  a  smaller  space 
requirement.   They  state  the  problem  thus:  given  n  infinite  planes,  preprocess  so  as  to  be  able  to 


quickly  determine  in  which  region  an  arbitrary  point  lies.   Note  that  there  are  0(n^)  regions. 

The  solution  goes  as  follows.  Project  the  lines  defined  by  intersections  of  pairs  of  planes 
onto  the  plane  z=0;  there  are  0(n-)  such  lines.  These  lines  define  0(n'*)  regions  on  the  plane 
z=0.  Each  region  is  the  projection  of  a  vertical  'cylinder'  in  the  3-dimensional  space.  On  travers- 
ing such  a  cylinder  from  bottom  to  top  we  cross  the  n  planes  in  a  specific  order,  which  we  store  in 
a  list.  The  lists  corresponding  to  adjacent  cylinders  will  disagree  on  exactly  those  planes  that  meet 
in  a  line  lying  in  the  separating  plane  (the  plane  that  separates  the  two  cylinders). 

We  now  have  the  problem  of  storing  0{n*)  similar  lists,  each  of  length  n;  however  they  do 
not  lie  in  a  linear  sequence,  as  is  assumed  by  the  methods  given  above.  This  is  not  insurmount- 
able. Consider  the  graph  defined  by  the  lines  projected  onto  the  plane  z-0.  We  look  at  the  dual 
graph  which  has  a  vertex  for  each  region,  and  an  edge  joining  each  pair  of  vertices  corresponding 
to  adjacent  regions.  We  take  a  spanning  tree  of  this  graph  and  then  form  an  Euler  circuit  of  the 
spanning  tree  (each  edge  is  traversed  once  in  both  directions).  We  form  the  sequence  5  of  ver- 
tices in  the  Euler  circuit.  It  has  length  0(n'^)  and  every  vertex  occurs  at  least  once  in  S.  Recall 
that  to  each  vertex  there  corresponds  a  region,  and  for  each  region  there  is  a  list.  We  let  the 
sequence  of  lists  be  given  by  the  corresponding  sequence  of  vertices.  Since  adjacent  vertices 
correspond  to  adjacent  regions,  so  do  adjacent  lists.  The  sum  of  the  differences  between  adjacent 
lists  in  the  sequence  we  obtain  is  0{n'*).  Using  our  second  method  we  store  these  lists  in  space 
0(/i'*),  while  having  a  search  time  of  O(logn). 

To  complete  the  algorithm,  v/e  need  a  method  for  determining  in  which  cylinder  a  point  lies. 
By  projecting  onto  the  plane  z=0  this  becomes  a  planar  point  location  algorithm,  for  a  planar 
straight  edge  graph  of  size  0  («■*).  We  can  use  the  algorithm  described  above,  for  example.  It 
requires  space  0(n'*)  and  has  a  query  time  of  O(logn).  So  the  overall  complexity  of  the  algorithm 
is  0(n"',logrt),  improving  on  the  0(n^,\ogn)  of  [DL].  Chazelle  [Chi]  gives  a  solution  having  com- 
plexity 0(n^,hg~n).   Recall  that  there  are  0(n^)  possible  answers. 


The  problem  can  be  formulated  more  generally:  given  a  set  of  finite  or  infinite  plane  faces 
dividing  a  3-dimensional  space  into  regions,  preprocess  so  as  to  be  able  to  quickly  determine  in 
which  region  an  arbitrary  point  lies.  We  assume  each  face  is  given  as  a  series  of  edges,  that  pairs 
of  faces  intersect  only  in  their  edges  or  their  vertices,  and  that  the  size  of  the  problem  in  this 
description  is  n.  We  follow  the  method  outlined  above.  But  in  storing  the  lists  we  notice  they 
may  have  variable  length,  so  we  convert  this  to  a  neighbour  problem.  We  consider  the  0{n-)  lists 
to  be  a  list  varying  over  time  (time  being  given  by  the  order  of  the  lists  derived  from  the  Euler 
circuit).  We  have  a  universe  of  up  to  n  objects  (the  faces)  and  a  partial  order  is  defined  by  their 
order  in  the  lists.  Note  that  as  time  varies  a  given  face  may  be  added  and  deleted  several  times 
from  the  ordered  list  of  faces.  Now  we  use  the  solution  for  the  neighbour  problem,  which  gives 
an  algorithm  having  complexity  0(n-,logn). 

Notice  that  we  could  use  the  first  method  for  storing  lists  to  obtain  simpler  solutions  to  the 
planar  and  3-dimensional  point  location  problems,  having  complexities  0(/ilogn,logn)  and 
0(n-log/i,logn)  respectively. 

5.3.  Semi-Rectangular  Range  Queries 

The  problem  is  the  following.  Qven  a  set  of  n  points  in  the  plane,  preprocess  so  as  to 
respond  to  queries  <Xi,X2,z>  by  listing  those  points  (x,y)  such  that  jt,<jr<x,  and  z^y,  in  time 
0{\ogn  +  s),  where  s  is  the  number  of  points  listed.  McCreight  [M]  has  given  a  solution  requiring 
space  0(n)  for  a  dynamic  version  of  this  problem.  We  only  provide  a  solution  for  the  static  ver- 
sion. Nonetheless,  we  think  our  solution  is  of  interest  as  it  provides  another  way  of  looking  at  the 
problem. 

We  sort  the  n  points  by  increasing  y-coordinate,  breaking  ties  arbitrarily:  Pi<;»2<  '  '  '  ^Pn< 
say.  We  ceate  n  lists  L,  ,L2,...,L„,  where  Z,,  contains  the  points  p^,p2,...,Pi  sorted  by  j: -coordinate. 
To  respond  to  a  query  we  find  the  largest  value  of  i  such  that  all  the  y-coordinates  of  points  in  L, 
are  bounded  by  z;  we  then  output  those  points  in  I,  whose  x-coordinates  lie  in  the  given  range. 
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We  could  store  the  lists  using  the  data  structure  described  in  section  3.  However,  we  would 
have  problems  in  listing  items,  as  there  could  be  many  locations  in  a  list  storing  no  item,  and  we 
might  inspect  many  such  locations.  To  circumvent  this  difficulty  we  use  a  variant  of  the  demotion 
strategy,  which  we  call  the  rerating  strategy.  If  there  is  no  item  in  the  location  with  rank  j,  first 
we  seek  an  item  to  demote  from  /s  right  subtree;  if  there  is  no  such  item  we  seek  tlie  item  of 
highest  rank  in  the  left  subtree  which  we  promote.  As  before,  we  do  not  rerate  items  to  two  dif- 
ferent locations.  Further,  we  know  that  if  an  item  was  promoted  to  rank  j,  then  there  was  no 
item  in  /s  right  subtree.   And  if  no  item  is  rerated  to  location  ;  there  are  two  possibilities: 

(a)  there  was  no  item  in  the  subtree  rooted  at ;'. 

(b)  the  item  that  was  to  have  been  rerated  to  location  j  had  already  been  rerated  elsewhere. 

In  the  latter  case  we  would  already  have  encountered  the  rerated  item  on  the  search,  so  we  are 
able  to  dedde  either  to  finish  the  listing  (case  (a))  or  which  way  to  branch  (case  (b)). 

We  need  a  second  change,  for  we  may  spend  a  long  time  traversing  through  locations  storing 
no  item.  We  will  consider  the  case  where  an  item  has  been  demoted  from  rank  k  to  rank  ;'.  K 
the  result  of  a  comparison  with  the  item  at  the  location  with  rank  j  is  to  branch  right,  then  we 
might  as  well  inspect  the  item  at  k's  right  child  immediately,  as  we  will  do  so  eventually  anyway. 
We  change  the  search  tree  pointers  accordingly.  The  case  where  an  item  has  been  promoted  is 
handled  similarly. 

We  carry  out  the  splitting  of  spans  as  described  previously.  As  before,  it  only  increases  the 
size  of  the  data  structure  by  a  constant. 

Thus  we  obtain  a  data  structure  of  size  0{n)  which  allows  the  answering  of  a  semi-rectangle 
query  in  time  0(\ogn  +  s)  where  s  is  the  number  of  points  listed.  Notice  that  this  data  structure  is 
not  the  same  as  the  canal  tree  of  Chazelle.  In  fact,  this  is  the  data  structure  one  would  need  if 
one  tried  to  reduce  the  list  problem,  as  defined  at  the  start,  to  a  neighbour  problem. 

It  should  be  noted  that  McCreight's  solution  is  much  simpler  than  ours,  for  his  data  structure 
was  specifically  designed  to  solve  this  problem. 
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RepresendDg  Polyhedra 

In  [DK]  the  problem  of  detecting  if  two  polyhedra  intersect  is  studied.  They  assume  that 
one  is  given  the  polyhedra  in  preprocessed  form,  and  then  they  show  how  to  answer  the  question 
quickly.  Their  algorithm  runs  in  time  O(log-n)  on  a  representation  of  the  polyhedra  that  uses 
space  0{n~),  and  it  runs  in  time  C>(log^n)  on  a  representation  of  the  polyhedra  that  uses  space 
O(nlogn).  We  give  a  representation  for  the  polyhedra  using  space  0(n)  which  allows  a  running 
time  of  O(log-n)  for  the  same  algorithm. 

Consider  the  polyhedron  in  an  xyz  coordinate  system,  and  consider  the  xy  cross  sections 
c»rresponding  to  the  z  values  of  all  its  vertices.  These  cross  sections  together  with  the  edges  join- 
ing adjacent  cross  sections  give  a  characterization  of  the  whole  polygon.  We  define  a  drum  to  con- 
sist of  two  adjacent  cross  sections  along  with  all  their  connecting  edges.  In  this  representation  a 
polyhedron  of  n  vertices  might  require  0{n~)  storage.  The  authors  suggest  using  an  alternative 
storage  scheme  described  in  [DM],  which  uses  space  O(nlogn);  however  it  takes  time  O(log-n)  to 
retrieve  im'ormation  about  particular  aspects  (edges,  vertices,  faces)  of  particular  drums.  For- 
tunately, in  the  algorithm  pK]  give  for  detecting  polyhedral  intersections  O(logn)  operations  at 
each  iteration  are  sufficient  to  yield  the  next  edge,  vertex,  or  face. 

As  we  describe  below,  using  the  list  structure,  we  are  able  to  store  the  representation  of  a 
polyhedron  as  a  set  of  drums  in  0{n)  space,  and  yet  achieve  an  O(logn)  retrieval  time.  Further, 
in  the  algorithm  for  detecting  polyhedral  intersections  0(1)  operations  at  each  iteration  are  suffi- 
cient. 

A  drum  is  a  sequence  of  faces  and  edges.  We  represent  the  drum  by  storing,  in  a  list,  the 
sequence  of  faces,  each  face  being  represented  by  the  two  polyhedron  edges  bounding  it.  These 
edges  are  stored  as  infinite  lines.  By  keeping  the  z  values  of  the  drum's  bounding  faces  elsewhere, 
we  are  then  able  to  obtain  any  face,  edge,  or  vertex  of  the  drum. 

To  give  an  order  to  the  faces,  we  associate  an  angle  with  each  face.  This  angle  is  the  angle 
between  the  line  x=z=0  and  the  line  that  is  the  intersection  of  the  plane  containing  the  face  with 
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the  plane  z-Q.  Further,  we  give  a  direction  to  the  latter  line:  it  has  direction  corresponding  to 
traversing  the  drum  in  clockwise  order. 

We  choose  some  direction  as  the  first  direction,  and  we  are  then  able  to  consistently  order 
the  polyhedron's  faces.  We  order  each  list,  corresponding  to  a  drum,  accordingly.  This  is  analo- 
gous to  our  representation  of  a  planar  graph,  and  we  store  the  lists  in  the  same  way,  in  space 
0{n). 

The  main  iterative  step  in  the  polyhedral  intersection  algorithm  requires  the  following  opera- 
tion: given  a  segment  of  a  drum,  that  is  a  contiguous  sequence  of  k  faces  (given  by  specifying  the 
first  and  last  faces  in  the  sequence,  for  example),  find  the  k/2th  face  in  the  sequence  in  time  0(1). 
This  operation  we  cannot  do  rapidly;  however,  inspection  of  the  algorithm  shows  that  we  really 
want  that  after  OQogn)  iterations  a  drum  segment  of  length  n  has  been  reduced  to  length  0(1), 
by  repeated  partition.  This  we  can  do  in  0(1)  steps  per  iteration.  It  is  most  simply  described  by 
using  the  tree  analogy  for  storing  a  list.  If  we  have  a  segment  covering  the  range  represented  by  a 
subtree  rooted  at  ;,  together  with  the  two  nodes  representing  the  extremal  points,  we  simply  look 
at  the  face  stored  in  location  7,  if  any.  If  this  location  is  empty  we  reduce  the  range  of  the  search 
to  the  left  subtree  or  to  the  right  subtree  (plus  the  two  extremal  points),  for  one  of  these  ranges  is 
empty,  and  further  at  this  point  in  the  search  we  know  which  one  is  the  empty  one.  Otherwise  we 
proceed  according  to  the  algorithm,  which  determines  the  subsegment  to  be  eliminated. 

We  observe  that  for  each  drum  we  store  every  edge  twice.  The  reason  is  that  in  the  data 
structure  we  give  it  may  be  hard  to  traverse  a  drum  (list)  from  one  face  to  its  neighbour.  How- 
ever, it  requires  only  some  fairly  simple  modifications,  left  to  the  reader,  to  make  such  a  traversal 
easy  (0(1)  time).  Thus  we  could  store  the  drums  by  storing  their  edges  rather  than  their  faces. 

6.  Conclusions  and  Further  Research 

We  have  presented  a  scheme  for  allowing  fast  querying  of  similar  lists  while  using  minimal 
storage.    As  can  be  seen  from  the  example  of  the  3-dimensional  point  location  algorithm  the  lists 
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need  only  lie  in  a  connected  graph.  Can  the  method  be  extended  to  a  2-dimensional  grid  of  lists, 
where  the  lists  vary  slowly  in  both  directions?  Say  an  n  xn  grid,  with  lists  of  length  n,  yet  only 
requiring  0{n)  storage,  although  there  are  O(n-)  Usts?  This  could  have  applications  to  the  3- 
dimensional  point  location  algorithm  and  possibly  elsewhere. 

Another  interesting  question  is  to  what  extent  can  everything  be  made  dynamic?  Qearly, 
we  can  accomodate  changes  to  our  list  as  time  increases,  but  could  we  insert  changes  in  the  past, 
for  example?  This  would  then  allow  us  to  solve  the  dynamic  version  of  the  semi-rectangular  range 
query  problem. 

Finally,  can  we  improve  the  polyhedral  intersection  algorithm  to  run  in  time  O(logn)? 
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